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ABSTRACT 

The young massive OB assoeiation Cygnus OB2, in the Cygnus X eomplex, is the 
elosest (~ 1400 pe) star forming region to the Sun hosting thousands of young low 
mass stars and up to 1000 OB stars, among whieh are some of the most massive stars 
known in our Galaxy. This region holds great importanee for several fields of modem 
astrophysies, sueh as the study of the physieal properties of massive and young low- 
mass stars and the feedbaek provided by massive stars on star and planet formation 
proeess. 

Cygnus OB2 has been reeently observed with Chandra/ACIS-I as part of the 
1.08 Msee Chandra Cygnus OB2 Legaey Projeet. This survey deteeted 7924 X-ray 
sourees in a square degree area eentered on Cyg OB2. Sinee a proper elassifieation 
and study of the observed X-ray sourees also requires the analysis of their optieal and 
infrared eounterparts, we eombined a large and deep set of optieal and infrared eata- 
logs available for this region with our new X-ray eatalog. In this paper we deseribe the 
matehing proeedure and present the eombined eatalog eontaining 5703 sourees. We 
also briefly diseuss the nature of the X-ray sourees with optieal and infrared eounter¬ 
parts using their position in the eolor-magnitude and eolor-eolor diagrams. 
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1. Introduction 

The study of young stellar elusters, to¬ 
gether with the eorreet elassifieation of their 
stellar eontent, generally relies on a eombina- 
tion of available multi-wavelength data, from 
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X-rays to optical and infrared. A key aspect 
of such studies of crowded stellar fields is the 
procedure adopted for merging the different 
datasets. It is important to minimize the num¬ 
ber of spurious coincidences and false nega¬ 
tives (i.e. sources in one waveband that fail 
to be matched with their real counterparts 
in another). A lack of accuracy and com¬ 
pleteness in the data merging process might 
adversely affect source classification and the 
subsequent interpretation of the results. 

When sparse catalogs are matched, the 
chances of spurious coincidences are reason¬ 
ably low. In these cases, simple matches 
based on source positions can be safely 
adopted. When the source density of one 
or more of the catalogs is high, such that 
the probability of finding more than one ob¬ 
ject within the bounds of a source position 
uncertainty is deemed significant, the use of 
more complicated methods that take into ac¬ 
count the expected multiwaveband properties 
of the source populations must be used. To 
this aim, several Maximum Likelihood meth- 
ods have been proposed in the literature (e.g.: 


Sutherland & Saundersl.ll992 ). 


Cygnus OB2 is the central massive OB 
association of the giant Cygnus X complex, 
with a rich population of young stars spread 
over an area of more than one square de¬ 
gree. Because of its very rich population 
of massive stars, Cygnus OB2 has been de¬ 
scribed as a very young globular cl uster in 
the Milky Way ( Knodlseder . 2000h . The 
census of the massive population of this 
association ranges f rom the first count by 
Reddish et al. ( 1967 ) of 300 OB members, 
to the estimate based on 2M ASS data of 
more than 2600 OB stars by IKnodlsedei 


(120001) . More recent studies found some¬ 
what lower population of massive stars and 


identified in Cygnus OB2 some of the most 
massive stars known in our Galaxy, such as 


03 stars and B supergiants dWalbornl. 11973 


Massey & ThompsonL 


1991: Comeron et al 


20021: IHansonl120031: iNegueruela et al.l.120081) . 


Despite the extinction toward Cyg OB 2 
being high due to the intervening nebulos¬ 
ity associated with the Cygnus Rift (roughly 
ranging from Ay ~ 2.5'” to Ay ~ 8'” for 


the opticallv identified members: Drew et al., 

20081: Sale et al.. 

2009: Guarcello et al.. 2012: 

Sale et al. 

2014 

), its relative proximity (~ 

1400 pc, I 

^vsletal.. 2012) has made it the 


subject of several studies aimed at under¬ 
standing its rich stellar content. Indeed, being 
the massive association with the largest mas¬ 
sive star content in the proximity (i.e. within 
2 kpc) of our Sun, with a massive popula¬ 
tion that has no equal in the other nearby 
young clusters such as the Orion Nebula 
Cluster, and begin also rich in pre-main se¬ 
quence stars (lAlbacete Colombo et al.L l2007l: 


Wright & Drakel.l2009l:lGuarcello et al.Ll2013l) . 
Cyg OB2 is also arguably the best available 
target to study star formation, disk evolution, 
and planet formation in presenc e of massive 
stars (e.g. IWright et al.l. l2014bl) . The aver¬ 
age age of the stars in the central part of the 
association has bee n estimated to range be¬ 
tween 3 and 5 Myrs ( Wright et al.l 201(i) . but 
several new star forming sites hosting a large 
fraction of very young stars still embedded in 
a contracting envelope or t hick circumstellar 


disk have been disc o vered (IVink et al.L 12008 


Wright et"^ 20121: Guarcello et al. . 2013 ). 
There are also indications that some OB stars 
in the associat ion are younger than 2 Myrs 
(IHansonl. l2003h . while a population of A stars 
found in the southern are a appears to have an 
age between 5-7 Myrs dPrew et al.l.l2008h . 

The promise of Cyg OB2 to be able to 
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shed new light on the workings and products 
of massive star forming regions motivated a 
large Chandra X-ray Observatory 1.08 Msec 
Legacy Project (IDr^d. 12014. in preparation!: 
Wright et ah . 2014ah . At ages of a few mil¬ 
lion years, stars of all masses are about three 
and four orders of magnitude stronger X-ray 
emitters compared with older populations. 
Hard X-ray photons can penetrate many mag¬ 
nitudes of visual extinction and provide an 
effective diagnostic of youth that is free from 
biases resulting from accretion from a pro¬ 
toplanetary disk and the presence of circum- 
stellar material. 

The direct aim of the survey was to use 
the selective power of X-rays together with 
the arcsecond spatial resolution of Chandra 
to perform a deep census of the stellar popu¬ 
lation and its X-ray properties, but with the 
main scientific goals of understanding the 
evolution of protoplanetary disks and star for¬ 
mation in an association approaching stellar 
supercluster dimensions. The resulting Chan¬ 
dra catalog contains 7924 X-ray sources over 
an area of ab out one square degre e centered 
on Cyg OB2 ( Wright et ah . 2014ah . Supple¬ 
mentary optical and infrared data, required to 
classify the X-ray sources and follow through 
with the scientific objectives of our survey, 
have been retrieved from available public 
surveys (SDSS/DR8, IPHAS/DR2, UKIDSS, 
2MASS) and obtained from dedicated obser¬ 
vation s with OSIR IS @GTC (Guarcello et al. ^ 
2012 ) and Spitzer ( Beerer et al. . 2010 : Guarcello ^ 
20131) . 

A crucial step in being able to use the 
available multi-wavelength catalogs consists 
of determining which objects in one catalog 
correspond to sources in another. Given the 
large stellar density in Cyg OB2, the depth of 
the OIR catalogs used, and the large fore¬ 


ground and background observed popula¬ 
tions, simple nearest neighbor approaches 
can fail because several potential counter¬ 
parts can fall within the positional uncer¬ 
tainty of a given source. More sophisticated 
approaches have employed likelihood ratio 
methods that seek to utilize other information 


parative brightness (see. 

e.g.. 

Richtei. 

1975; 

Sutherland & Saunders 


1992 


Smith et al. 


1201 ih . 


Here, we describe the matching of multi¬ 
wavelength sources to those detected in the 
Chandra Cyg OB2 survey. A brief overview 
of the X-ray, optical and infrared catalogs is 
presented in Sections |2] and [H the methods 
employed to cross-match objects in different 
catalogs are described in Section 0] and the 
final catalog is described in Sect. [51 We sum¬ 
marize the main points of the study in Section 

il 

2. The X-ray catalog 

The Chandra Cyg OB 2 Legacy survey 
design employed 36 pointings of 30 ks ex¬ 
posure each in a 6x6 raster array heavily 
(~ 50%) overlapped in order to overcome 
the Chandra lower off-axis sensitivity and 
produce a relatively uniform exposure over 
the inner 0.5 deg^ corresponding to a depth 
of 116 ks. The full survey exposure was 
1.08 Msec, it covered about 1 square degree 


etaKpntered at 20h 33m 12s -i-41 19' 00", and 
was performed over a 6-weeks period from 
January-March 2010, employing the Ad- 
vanced CCD Imagin g Spectrometer (ACIS-I; 
Garmire et al. 2003b . 


The point source catalog was constructed 
using a combination of standard CIAO pro¬ 
cessing tools, source detection algorithms. 
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and the ACIS Extract (AE; iBroos et alJl2010l) 
software package. In order to have an ho¬ 
mogeneous astrometry among the various 
Chandra pointings,ChanJra astrometry was 
re-mapped to that of the Two Micron All 
Sky Survey (2MASS; Cutri et all 2003 ) us¬ 
ing bright X-ray sources with unambiguous 
cross-matches to 2MASS objects. Source de¬ 
tection was applied to the reduced and pro¬ 
cessed Chandra data in three energy bands: 
soft (0.5-2.0 keV), hard (2.0-7.0 keV), and 
broad (0.5-7.0 keV) using different algo¬ 
rithms: An enhanced version of the Cl AO 
tool WAVDETECT that performs source detection 
on multiple non-aligned X-ray observations, 
detecting sources that may not be detected 
in th e individual observat ions, and pwde- 
TECT (iDamiani et al.l. Il997|) . This process 


was augmented by several hundred sources 
from lists of known Cyg OB2 members, in¬ 
cluding O and B-type stars dWright et ah . 
12014. submitted!) and young A-type stars 
dOrew etS^ . 2008 ). creating a total of 13,041 
source candidates. 


Candidate source photometric extraction 
and validation was performed using AE in an 
iterative fashion, whereby validated sources 
were excluded from regions used for back¬ 
ground estimation, followed by a repeat of 
the AE extraction and validation. Owing to 
the overlapping source and background re¬ 
gions in the most crowded areas of the sur¬ 
vey, several iterations of this process were 
required. The resulting X-ray catalog con¬ 
tains 7924 verified sources, the vast major¬ 
ity of which were observed at least 4 times 
in overlapping tiles, and detected within 4 
arcmin of the telescope optical axis at least 
once. The source positional uncertainty is 
typically < 0.5" and we estimate a 90% com¬ 
pleteness for stellar X-ray sources down to 


X-ray luminosity of 7 x 10^^ erg cm^ s“^ in 
the central 0.5 square degrees. A full descrip- 
tio n of the cata l og con struction is presented 
by lWright et al. ( 2014a ). while an assessment 
of the cat alog con t ents and sensitivity ar e dis¬ 
cussed bv lWrighti (12014. in preparation!) . 


3. The optical-infrared catalog 

The optical-infrared (OIR) catalog used in 
this work contains photometric data retrieved 
from several publicly available catalogs: 


• the optical catalog in r, i, z bands 
(65349 sources) obtained from obser¬ 
vations with the Optical System for 
Imaging and low Resolution Integrated 
Spectroscopy (OSIRIS), mounted on 
the 10.4 m Gran Telescopio CANARIAS 
(GTC) of the Spanish Observatorio 
del Ro que de los Much achos in Ea 


Palma ( 

Gena et al. 

2 OOOI) comniled bv 

Guarcello et al. ( 

2 OI 2 I): 


• the second release of the optical catalog 
in r', i', Ha bands (24072 sources) ob¬ 
tained from observations with the Wide 
Eield Camera (WEC) on the 2.5 m 
Isaac Newton Telescope (INT) for the 
INT Photometric Ha Survey (IPHAS, 
Drew et a02005 : Barentsen et al. 12014 ): 


• the SDSS catalog (eighth data release, 
DR8, 27531 sources, Aihara et al. . 


201 Ih in u, g, r, i, z bands; 


• the U KIDSS/GPS catalog in the JHK 
bands (iHewett et al.Ll2006l: Lucas et al. . 


20081) . containing 273473 sources, from 


observations taken with the Wide Eield 


Camera (WECAM, ICasali et al.L l2007h 
on the United Kingdom InfraRed Tele¬ 
scope (UKIRT), compiled adopting a 
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new p hotometric procedure (iKing et al. . 


2013h based on the UKIDSS images 


( Dye et al. . 20061) : 


the 2M ASS/PSC catalog in JHK (ICutri et al 


20031 43485 sources); 


the catalog in the Spitzer/IRAC 3.6, 4.5, 
5.8, 8.0//m and MIPS 24yum bands 
(149381 sources) from the Spitzer 
Legacy Survey of the Cygn us X region 
Spitzer ( Beerer et al. . 2010 ). 


As described in iGuarcello et all (1201 31) . 
these catalogs have been combined into a 
large OIR catalog containing 329514 sources. 
The matching procedure was divided into 
three steps. First, a combined optical cata¬ 
log was produced by matching the OSIRIS, 
IPHAS, and SDSS catalogs. Second, an 
infrared catalog was created by matching 
UKIDSS, 2MASS and Spitzer data. In the 
last step, the two catalogs were merged 
into an unique OIR catalog. All the data 
used here, except those from OSIRIS, are 
available over the entire area surveyed with 
Chandra/ACIS-I. The OSIRIS data are only 
available in a central 40' x 40' field. 

The OIR catalog includes stars associ¬ 
ated with Cygnus OB2 down to very low 
masses. As s uming a distance of 1.4+0.08 kpc 


(IRygl et al.L l2()12l). an average exti nction 
4.3'" ( Guarcello et al. . 2012 ). and 


adopting the isochrones of Siess et al. ( 2000[) . 
we can estimate that we have good quality 
optical and infrared data for members down 
to 0.2 Mq, allowing us an unprecedentedly 
deep and complete study of the population of 
Cygnus OB2. 


4. The adopted matching procedures 


The X-ray sources in our survey need to be 
classi fied also according to their OIR prop¬ 
erties (IKashvapl 12014. in preparationh . Erro¬ 
neous matches between the OIR and X-ray 
catalogs will result in wrong classifications, 
affecting the scientific outcome of our survey. 
For this reason, particular attention must be 
given to how the OIR and X-ray catalogs are 
merged. 


A simple matching procedure based on the 
positions of the sources and using a fixed 
matching radius (i.e. considering as real 
counterparts the OIR and X-ray pairs with a 
separation smaller than a given threshold) is 
unsuitable to our case for two reasons. First, 
the Point Spread Function (PSF) of the Chan¬ 
dra mirrors increases in size with increasing 
off-axis angle. For this reason, the positional 
accuracy of the X-ray sources is not constant 
across the field. Second, while the optical 
data are dominated by the foreground stel¬ 
lar population, and the infrared data by the 
background sources, in both cases with an 
approximately uniform spatial distribution, 
most of the X-ray sources with OIR coun¬ 
terparts are expected to be associated with 
Cygnus OB2 and clustered at the locations 
of the various subclusters of the association. 
The density of the sources not associated with 
the X-ray population (the uncorrelated pop¬ 
ulation) is high, and any attempt at match¬ 
ing the OIR and X-ray catalogs using only 
positional information will inevitably result 
in large numbers of spurious matches. It is 
necessary, then, to use a more sophisticated 
approach. 


One method used successfully in similarly 
challenging matching proc edures is based on 


Maximum Fikelihood (MF. ISutherland & Saunders 
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19921) approaches that takes into aeeount both 
the spatial separation between the different 
catalog sources (OIR and X-ray in our ease) 
and how the magnitude of the OIR sources 
compare with those expected for the real OIR 
counterparts of the X-ray sources (the corre¬ 
lated population). Several ML methods have 


been used in the literature (i.e. ITavlor et ah . 


2005 


Gilmour et al.L 120071: iRumbaugh et al 


20121) . In this work, rather than rely on a sin¬ 


gle matehing proeedure, we adopt three dif¬ 
ferent methods. The final OIR-X-ray eatalog 
will eontain all the pairs matehed with eaeh 
of the three methods, with the subsample of 
the most reliable matehes properly tagged. 


4.1. Modified I Smith et alJ (120111) proce¬ 
dure 

One of the methods tha t we adopted is de¬ 
fined in ISmith et al.l (1201 11) . slightly modified 
in order to optimize it for our speeifie multi¬ 
wavelength ease. In this approaeh, the prob¬ 
ability that a given OIR souree is the eorreet 
counterpart of a nearby X-ray source is eal- 
culated starting from the following likelihood 
ratio: 

q{m)f{r) 


LR = 


n (m) 


( 1 ) 


In this definition, /(r) is the radial distribu¬ 
tion funetion of the separations between OIR 
and X-ray pairs as a funetion of the positional 
error: 


f{r) = 


'l.TTCr po^ 


exp 



( 2 ) 


Here, r is the positional offset between OIR 
and X-ray sourees, and cTpos the positional 
uneertainties, ealeulated adding in quadra¬ 
ture the OIR and X-ray positional uneertainty. 
The quantities q{m) and n{m), i.e. the mag¬ 
nitude probability distributions of the eorre- 


lated OIR sourees and the observed magni¬ 
tude probability distribution of all the OIR 
sourees in the m band, respeetively, are de- 
seribed in the next seetions. 


4.1.1. The observed magnitude distribu¬ 
tions 


In Eq. [T]( 5 r (m) and n (m) are the probabili¬ 
ties to observe, respeetively, a eorrelated and 
a generie OIR souree with magnitude m. The 
main differenee be tween the method w e used 
and that defined in Smith et al. ( 2011 ) is that 
the latter method is applied using one optical 
catalog. Our multi-wavelength eatalog eon- 
tains data from various optical and infrared 
eatalogs, and most of the sourees laek a de- 
teetion in one or more of them. For instance, 
highly embedded or extinguished objeets in 
the baekground, or even assoeiated with the 
most obseured regions of Cygnus OB2, often 
laek optieal eounterparts. 


For this reason, we seek to use all the OIR 
information available in order to improve the 
eompleteness of the final OIR-i-X-ray cata¬ 
log. We ealeulated, then, q (m) and n (m) for 
eaeh band available in our OIR eatalog. We 
also defined for eaeh OIR souree a represen¬ 
tative band, whieh is the first one available 
and with an error smaller than 0.1™ proeeed- 
ing from shorter to longer wavelengths, start¬ 
ing from the r band. In the optieal bands we 
used preferentially the OSIRIS photometry. 
The IPHAS photometry has been used when 
OSIRIS data are not available, and SDSS 
photometry when there are no other optieal 
data. Our main eatalog in the near infrared is 
UKIDSS, while 2MASS data are used when 
UKIDSS data are not available or of bad qual¬ 
ity. This priority among the available bands 
has been arbitrary ehosen, after having ver¬ 
ified that the ehosen order was not affeet- 
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ing our results, since it was possible to de¬ 
fine a representative band for almost all the 
bands. For the vast majority of sources in our 
OIR catalog, the representative bands are the 
OSIRIS r or the UKIDSS J bands. 

To obtain n (m), we first calculated the ob¬ 
served magnitude distributions in each band 
of our catalog, sampled in bins of 0.25"* of 
width. The probability n (m) for a given OIR 
source is then given by the fraction of sources 
observed in our OIR catalog in the represen¬ 
tative band in the same magnitude bin, nor¬ 
malized by the total area of the survey. 


4.1.2. The correlated magnitude distribu¬ 
tions 

The calculation of q{m) for each OIR 
source is more complicated, since it requires 
the computation of the magnitude distribution 
of the expected correlated population, after 
considering and removing the contribution 
fro m the uncorrelated population. Follow¬ 
ing [SmithetalJ (1201 ih . the initial approxima¬ 
tion of the expected magnitude distribution 
of the correlated population is obtained from 
all the OIR sources closer than 10" to the X- 
ray sources (hereafter the nearby population). 
Even at large off-axis angles, this matching 
radius is significantly larger than the propa¬ 
gated positional uncertainty, resulting in a se¬ 
lection of 78182 nearby OIR sources for the 
7924 X-ray sources. The expected magni¬ 
tude distribution of the correlated population 
is obtained from that of the nearby popula¬ 
tion, by subtracting in each magnitude bin 
the number of uncorrelated sources expected 
to match the positions the X-ray sources and 
falling in the given bin of magnitude: 


q{m) = nearby {m)—N {m)y.N, (3) 

^tot 


where nearby (m) is the magnitude distri¬ 
bution of the nearby sources, Nx is the total 
number of X-ray sources (7924), Amatch is the 
matching area with a radius of 10", Atot is the 
total area of our survey (1 square degree), and 
N (m) is the observed magnitude distribution 
n{m) X Atof By using this formula, we are 
also assuming that the uncorrelated sources 
are uniformly distributed in the survey area, 
which may be incorrect in case of not uniform 
extinction such as in Cyg OB 2. 

Fig. [T] shows the magnitude distributions 
in the OSIRIS r and UKIDSS J bands for 
the entire OIR catalog, the nearby sources, 
and the expected correlated population. In the 
optical band there is not much difference be¬ 
tween these distributions, not even in the faint 
part. This may indicate that this method is 
not very effective in removing fortuitous co¬ 
incidences between X-ray sources and faint 
background optical sources. The correction 
used to remove the uncorrelated population 
has been more effective in the J band, as 
demonstrated by the difference between the 
total distribution, centered at 7 = 19.5™, and 
the expected correlated distribution, centered 
in the range 16™ < 7 < 17™. 

4.1.3. Reliability associated with an OIR-\-X- 
ray pair 

Once we have calculated / (r), q (m), and 
n (m), we can obtain LR for each pair of X- 
ray and OIR sources from Eq [TJ Using the 
value of LR, we can assign a probability that a 
given OIR source is the real counterpart of the 
nearby X-ray source, and compare it with a 
chosen threshold (see Sect. 14.1.41) . This prob¬ 
ability is calculated comparing the observed 
LR value with a distribution of LR values ob¬ 
tained from 200000 test X-ray sources uni¬ 
formly distributed across the field. The use 
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Fig. 1.— Magnitude distributions in the OSIRIS r (left panel) and UKIDSS J (right panel) bands 
for the entire OIR eatalog (solid histogram), the nearby sources (dotted histogram), and the ex- 
peeted eorrelated distributions (dashed histograms) using Eq. [3](SM method) 


of uniform spatial distribution of the X-ray 
sourees is an aeeeptable approximation sinee 
the OIR eatalog, dominated by baekground 
NIR sourees, has a nearly uniform spatial 
density. These test sourees were matehed 
with the OIR eatalog, obtaining a distribu¬ 
tion of simulated LR values from more than 
70000 pairs (the exponential form in /(r) 
outs any matoh between sourees more dis¬ 
tant than few areseoonds). The reliability as- 
soeiated with eaeh mateh between the X-ray 
and OIR sourees in our eatalog, whieh is, by 
definition, the probability that the given OIR 
souree is the real oounterpart of the nearby 
X-ray souree, is then ealoulated as: 


where is the reliability that the OIR souree 
i is the real oounterpart of the X-ray souree 
y; Nsim is the number of simulated LR val¬ 
ues; Ngt is the number of simulated LR values 
larger than the one observed between the i j 
pair: 

Ngt = N [LR,i,nui > LRi^ (5) 

In this way eaeh ij pair (i.e. eaeh pair of X- 
ray and OIR sourees) has an assooiated prob¬ 
ability that the OIR souree is the real oounter¬ 
part of the X-ray souree. 

4.1.4. Match results 


(4) 
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The last step oonsists in assigning a prob¬ 
ability out-off, i.e. to deeide what is the mini¬ 
mum reliability that identifies real matehes. 
























































This was performed by studying how the 
number of spurious matehes out of the total 
number of matches increases with decreasing 
the cut-off. To obtain the number of spurious 
matches, we repeated the matching proce¬ 
dure after “randomizing” our X-ray catalog, 
i.e. applying rigid translations of 1' to the 
X-ray sources four times, each time with a 
different combination of positive and nega¬ 
tive rigid translations in RA and DEC. The 
number of expected spurious matches corre¬ 
sponding to given test values of the cut-off is 
the mean of the number of matches obtained 
with these four “randomized” X-ray catalogs. 
We then fixed our cut-off value as the one for 
which the ratio of spurious to total matches is 
~ 10% (corresponding to Rcut-off = 0.95). 

Fig. [2] shows how the number of total and 
spurious matches, together with their ratio, 
vary with the test thresholds. With the chosen 
cut-off of 0.95, we matched 5180 pairs, with 
4946 single matches. Hereafter, this method 
is called the SM method. 

4.2. Matching procedure with the corre¬ 
lated population from an accurate 
position match. 

The second matching procedure is based 
on a different definition of the correlated pop¬ 
ulation. The definitions of LR (Eq. [T]), of 
the observed magnitude distributions n (m), 
as well as that of the reliability (Eqs. |4]and 
|5]) and the procedure to define the threshold 
(Sect. 14.1.41) are the same as those adopted in 
the previous sections. 

In Sect. 14.1.21 we calculated the magni¬ 
tude distribution of the expected correlated 
population starting from a position match 
between the X-ray and OIR catalog with a 
large matching radius (10"), and then we 
used a statistical approach to remove from 



Fig. 2.— Number of real (solid line) and spu¬ 
rious (dashed line) matches obtained with in¬ 
creasing the reliability threshold in the SM 
method. The numbers over the solid line 
show how the fraction of spurious matches 
decreases with increasing the threshold 

the nearby population the expected contribu¬ 
tion of the uncorrelated OIR sources. This 
correction was necessary, as proved by the 
very large number of nearby sources found 
(78182) and the risk that the SM method may 
not be very effective in removing spurious 
coincidences between X-ray and faint optical 
sources in the background. However, since 
the chances that OIR sources nearby X-ray 
positions are real counterparts decrease with 
increasing separation, a different estimate of 
the correlated population can be found with a 
nearest-neighbor match using suitable small 
matching radii estimated with a detailed sta¬ 
tistical analysis of real and spurious matches. 
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4.2.1. The correlated population from accu¬ 
rate nearest-neighbor match 


In this procedure, we obtained a corre¬ 
lated population from whieh we derived q (m) 
from an aeeurate nearest-neighbor mateh. 
The matehing radius used in this proeedure 
must take into aeeount the degradation of the 
PSF in the X-ray images at inereasing off- 
axis angles and the photon statistics of the 
X-ray source. In order to do that, we used 
an individual matehing radius for eaeh X-ray 
souree whieh is proportional to the X-ray po- 
sitional uncertain ty, the latter ealeulated as in 


Kim et al.l ( 2007h : 


fmatch — .A X O'pgs (6) 


log(crpos) = 0.11370 - 0.46/og(C) - 0.2398 

(7) 

log[o-po,) = O.lO310-O.1945Zog(C)-O.8O34 

( 8 ) 

where r^atch is the individual matehing radius, 
A is a coefficient to be evaluated, apos is the 
positional uneertainty, 0 is the off-axis angle 
of the X-ray sourees and C is the n e t num - 
ber of eounts. Following Kim et ahl ( 2007 ). 
Eq.|7]is applied to sourees with less than 133 
eounts; Eq. [8]to brighter sourees. 


Given the different depth and spatial dis¬ 
tribution of the optieal, JHK (UKIDSS and 
2MASS), and Spitzer eatalogs, we deeided to 
perform the nearest-neighbor mateh for each 
of these three eatalogs separately and then 
to merge the results. The erueial step here 
is to estimate the eoeffieients A for the three 
eatalogs, together with a minimum allowed 
matehing radius (r^^). The use of is nee- 
essary sinee the positional errors in the center 
of the ACIS field are very small, resulting in 
unaeeeptably low matehing radii. 


The proeedure adopted to ealeulate these 
parameters is similar to that defined in Seet. 14.1.41 


i.e. by eomparing the ratio of the numbers 
of spurious eoineidenees to that of the to¬ 
tal matehes obtained with inereasing test val¬ 
ues of A and The spurious eoineidenees 
are ealeulated by matehing the OIR catalogs 
with “randomized” X-ray catalogs (as in Sect. 
14.1.41) : while the total number of matehes by 
eombining the OIR catalogs with the “real” 
X-ray eatalog (i.e. with no positional offset 
added). We first evaluated A and then in 
both oases as the largest test values at whieh 
the spurious matehes reaohed ~ 10% of the 
real matehes. TableHJlists the values of A and 
r^in found for the optieal, JHK and Spitzer 
catalogs, together with the total number of 
matehes. The eatalog of the expeoted corre¬ 
lated sourees obtained by merging the results 
of these three nearest-neighbor matches num¬ 
bers 5820 sourees, many less than the nearby 
sources defined in Sect. 14.1.21 and compa¬ 
rable to the final number of the OIR-i-X-ray 
pairs matohed in the merged eatalog (Seet. 
[5]). This eatalog has been used to define the 
magnitude distribution q (m) used in Eq. [TJ 
Eig. [3] shows the distributions of the 
OSIRIS r and UKIDSS J magnitudes for 
all the OIR sourees and for the expeeted 
correlated population found with the aeeu¬ 
rate nearest-neighbor mateh deseribed here. 
Comparison with Eig. [T] reveals that the 
magnitude distributions obtained with this 


Table 1: Results of the close-neighbor 
matehes 


OIR eatalog 

^min 

A 

N. matehes 

Optical 

0.6" 

1.4 

4917 

JHK 

0.5" 

0.9 

5025 

Spitzer 

0.5" 

1.3 

5278 
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method are shifted for brighter magnitudes: 
The r distribution of the correlated pop¬ 
ulation in Fig. [3] is eentered in the range 
20"® < r < 22'", while in Fig. [Hit is peaked 
at about r ~ 24'” similar to the distribution 
of the total OIR catalog. The effect in the J 
distribution is smaller. This indicates that this 
method is more effective than the SM method 
in removing candidate spurious matches with 
faint sources. 

The rest of the procedure (i.e. the cal¬ 
culation of the reliability and the threshold 
for reliable matches, equal to 0.96) was per¬ 
formed as described in Sects. ITO] and l4~L4l 
and resulted in 5210 matches (4933 single 
matches). Hereafter, this method is called the 
PM method. 


4.3. Naylor et al. 2013 Bayesian method 


A Bayesian technique to cross correlate 
X-ray catalogs wit h deep infrared data has 
been developed by iNaylor et al.l (12013h . as 
an extension of the techniq ues defined in 
Sutherland & Saunders ( 1992 ). This method 
has several similarities with the maximum 
likelihood methods described above. As in 
those methods, the magnitude of the candi¬ 
date OIR counterpart is compared with an 
expected magnitude distribution for the cor¬ 
related population to reduce the chances of 
matching the X-ray sources with an uncorre¬ 
lated background or foreground source. 

We defer a descri ption of the method 


to iNaylor et al.l (12013h . though note that it 
brings two significant improvements to ex¬ 
isting match criteria such as those we used 
to define our ML methods. First it has a 
more sophisticated method of estimating the 
magnitude distribution of the correlated pop¬ 
ulation, which allows for the fact that the 
presence of counterparts in the error circles 


means they are more crowded than the field. 
The second improvement lies in accounting 
simultaneously for all the OIR sources close 
to an X-ray position to calculate the proba¬ 
bility that each of them is the real counter¬ 
part of an X-ray source. This is not the case 
with the methods described above, where the 
reliability is calculated independently from 
the values obtained for the other nearby OIR 
sources. The main gain of the more sophis¬ 
ticated approach is in reducing chances of 
multiple source matches. 

We applied the method using the /-band, 
X-band, and [3.6] photometry, where the 
number of available sources with good pho¬ 
tometry is larger, merging the various cata¬ 
logs available for each band using the hier¬ 
archy described in Section 14.1.11 Since the 
matching method requires a magnitude for 
each star, we removed from the catalog all 
objects which did not have magnitudes in the 
band in question, or a magnitude whose un¬ 
certainty was greater than 0.3'”. 


The first stage of the matching process is 
to check the model of the X-ray error circles 
by comparing the distribution of stars around 
all the X-ray positions with that predicted by 
a model consisting of a uniform background 
and a set of counterparts. The latter were 
initially assumed to have a Gaussian distri¬ 
bution about the X -ray positions wit h the ra¬ 
dius given by the iKim et al.l (l2007b model. 
We found the data were best fitted by mul¬ 
tiplying the radius of the error circles by 0.6 
and adding in quadrature a position and X- 
ray flux independent unce rtainty of 0-2". The 
combination suggests the iKim et al.l (l2007b 
radii over-estimate the positional uncertainty 
in our data by perhaps 20 percent. We also 
explored systematic offsets in the data, and 
found then best fit corresponded to a shift in 
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Fig. 3.— Magnitude distributions in the OSIRIS r (left panel) and UKIDSS J (right panel) bands 
for the entire OIR eatalog (solid histogram), and for the OIR sourees matehed with the X-ray 
sourees using an individual matehing radius as deseribed in Seet. 14.2. 1[ i.e. in the PM method 
(dashed histogram) 


RA of 0.02" with no shift in deelination. 


The remainder of th e matehing proeess 
proeeeded as deseribed in lNaylor et al.l (120 13b . 
and resulted in a list of all stars whieh had a 
likelihood of being a eounterpart greater than 
0.05. The eounterpart probabilities are pre¬ 
sented along with those from the other meth¬ 
ods in Table |2l We ineluded in the list of 
eounterparts only those stars whose likeli¬ 
hoods exeeeded 0.8. We ean estimate the eon- 
tamination in this sample by summing all the 
likelihoods that a given star is not the eoun¬ 
terpart, and then dividing this by the number 
of eounterparts. For both the K and i band 
samples with likelihoods greater than 0.8 this 


gives a eontamination rate of about 2%. 

The total number of sourees matehed 
with this method is 5157. Considering only 
the single matehes, the number of sourees 
matehed by this method is similar to those 
matehed by the two ML methods: 4933 
sourees with the PM method, 4946 with SM, 
and finally 4958 with this method, ealled 
hereafter the NBF method. 

5. The final OIR+X-ray catalog 
5.1. Reliable matches 

The eatalog merged from all the above 
methods numbers 5703 OIR-i-X-ray pairs 
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matched with at least one method: 4643 
are matched with all three methods, 558 
with two, and 502 with just one. Among 
the matched sources, there are 5398 single 
matches and 305 multiple matches. Fig. |4] 
shows the r vs. r - i diagram of all the op¬ 
tical sources with good photometry within 
the Chandra Cygnus OB2 Legacy field. The 
co nditions for good pho tometry are defined 


m 


Guarcello et al.l (120121) : In short, they re¬ 
quire small errors (cr^ < 0.1™ and cr,._; < 
0.15™) in at least one of the three opti¬ 
cal catalogs we have used. Also shown in 
Fig. |4| are t he iso chrones and ZAMS from 
Siess et al. ( 2000l) . plotted usi n g a d istance 


of 1.4 + 0.08 kpc (iRvgl et al.l. l2012l) . with 


the extinction Ay = 4.3™ fo r the isochrones 
and A y = 1™ for the ZAMS ( Guarcello et ah . 


20121) • In the right side of the diagram are 


marked the r magnitudes of 3.5Myrs old 
stars with Ay = 4.3™ and d = 1.4 + 0.08 kpc 
with different masses. The extinction vec- 
tor is obta i ned fr om the extinction law of 
O’Donnelll (Il994 . The sources matched 


with only one method are overplotted with 
different symbols and colors, as explained 
in the legend. The sources matched only 
with the NBF method lie mainly in the clus¬ 
ter locus (i.e. in this diagram the area be¬ 
tween the isochrones), spanning all the mag¬ 
nitude range but with a larger fraction of 
faint sources. Following the reddening vec¬ 
tor, these sources can be stars associated 
with Cyg OB2 suffering large extinction, or 
low mass members observed during a pe¬ 
riod of high X-ray activity. The analysis 
of their X-ray spectra and light curves will 
help us to be tter classify them ( Flaccomi^ 
in preparationh . The sources matched only 


with the PM method populate mainly the re¬ 
gion of the diagram with intermediate magni¬ 
tudes. Most of them are compatible with the 



r-i 


Fig. 4.— r vs. r - i diagrams with all the 
sources with good optical photometry marked 
with gray dots, the isochrones with age < 
lOMyrs and the ZAMS pl otted using the 


distance and extinction as in Guarcello et al. 


( 20121) . The reddening vector and the masses 
corresponding to the 3.5 Myrs isochrone are 
also shown. Different symbols and colors 
mark the X-ray sources matched with just one 
method. 


cluster locus. The sources matched only with 
the SM method lie only in the faint end of 
the cluster locus. The fact that all the sources 
matched only by the SM method are faint is 
likely a consequence of the lower effective¬ 
ness of the SM method in removing spuri¬ 
ous matches with faint optical uncorrelated 
sources (Fig. [T])- These samples show the 
same properties in all the optical and infrared 
color-magnitude diagrams. 

Fig. |5] shows the J magnitudes of the OIR 
counterparts of the matched sources vs. their 
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Fig. 5.— J magnitude of the OIR counter¬ 
parts vs. the separation between the OIR 
and the X-ray counterparts for the sources 
matched only with the NBF or the ML meth¬ 
ods. 

separation from the X-ray sources, for the 
pairs matched only with the NBF method and 
those matched only with the ML methods. 
In this diagram the differences between these 
two samples are evident: the stars matched 
only with the NBF method are systemati¬ 
cally fainter and closer, indicating that the 
ML methods (mainly PM) has been very con¬ 
servative in removing candidate spurious co¬ 
incidences with close faint counterparts. On 
the other hand, the ML methods are more ef¬ 
fective in matching OIR counterparts with in¬ 
termediate and bright magnitudes and at large 
separations (i.e. larger than 1"), which are 
plausible given the size of the ACIS PSF. 

Table |2] shows a description of the merged 
OIR and X-ray catalog. The catalog contains 
79 columns: The first 55 show the optical 


and infrared photometry of the OIR counter¬ 
part, followed by seven columns with some 
of the X- ray properties of the X-ray counter¬ 
part from lWright et al.l (l2014an . The columns 
from the 63th to the 72th contain tags indi¬ 
cating the matching procedure that matched 
the two counterparts, with the related prob¬ 
abilities, while the last columns information 
on the multiple matches. As explained ear¬ 
lier, the 305 multiple matches are kept in the 
merged catalog. Different scientific uses of 
the catalog require different treatment of the 
multiple matches. For instance, studies based 
on stellar positions may simply consider the 
multiple matches as a single entry at a given 
position; or studies based on the photometric 
properties would require to discard the mul¬ 
tiple matches. For those cases where one 
of the matching counterparts in the multiple 
matches must be chosen, we provide in the 
catalog a column indicating the “best coun¬ 
terpart”. Users must use this column and deal 
with multiple matches with caution. Given 
the evidence that the SM method is not ef¬ 
ficient in removing spurious coincidences be¬ 
tween the X-ray and faint optical sources, and 
the fact that the expected magnitude distri¬ 
bution of the correlated stellar population is 
calculated with a statistical approach which 
is not fully suitable for our survey, we con¬ 
sider as “reliable matches” the 5619 sources 
matched by either the NBF (the NBF sam¬ 
ple) or PM (the ML sample, i.e. removing 
the sources matched only by the SM method) 
method. 


5.2. Properties of the X-ray+OIR sources 


Guarcello et al.l (120 13h studied the disk 


population of Cyg OB 2 within the field of 
the Chandra Cygnus OB2 Legacy Survey. 
They selected and classified 1843 stars with 
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Table 2: List and description of the columns of the optical, infrared and X-ray catalog of the 
Chandra Cygnus OB2 Legacy Survey 


Columns 

Description 



1 

Sequential ID 



2-3 

Right Ascension and Declination in J2000 (projected on 2MASS) 



4-9 

OSIRIS photometry in riz OSIRIS bands 



10-15 

IPHAS photometry in riHa IPHAS bands 



16-25 

SDSS photometry in ugriz SDSS bands 



26-31 

2MASS photometry in JHK 2MASS bands 



32-35 

2MASS quality flags ph_qual, rdjlg, bLflg, ccjlg 



36-45 

UKIDSS photometry in JHK plus the J - K and H - K colors 



46-53 

IRAC photometry at 3.6, 4.5, 5.8, 8.0/im 



54-55 

MIPS photometry in 24 pm 



56 

X-rav ID from Wright et al. (2014a') 



57 

exposure time of the X-ray source in sec 



58-60 

net, total, and background counts of the X-ray source 



61 

median photon energy observed 



62 

separation between the X-ray and the OIR counterparts in arcsec 



63 

tag equal to 1 if the pair is merged by the PM method 



64 

tag equal to 1 if the pair is merged by the SM method 



65 

tag equal to 1 if the pair is merged by the NBF method using i 



66 

tag equal to 1 if the pair is merged by the NBF method using K 



67 

tag equal to 1 if the pair is merged by the NBF method using [3.6] 



68 

probability that OIR and X-ray source are real counterparts from the PM method 


69 

probability that OIR and X-ray source are real counterparts from the SM method 


70 

probability that OIR and X-ray source are real counterparts from the NBF 

method 

using i 

71 

probability that OIR and X-ray source are real counterparts from the NBF 

method 

using K 

72 

probability that OIR and X-ray source are real counterparts from the NBF 

method 

using [3.6] 

73 

multiple OSIRIS-IPHAS matches (0 if a single match) 



74 

multiple UKIDSS-2MASS matches (0 if a single match) 



75 

multiple NIR-Optical matches (0 if a single match) 



76 

multiple NIR-l-OPT-Spitzer matches (0 if a single match) 



77 

multiple OIR-X-ray matches (0 if a single match) 



78 

equal to 1 if it is a preferred counterpart (always 1 in single matches) 



79 

equal to 1 if it is a reliable match 
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disks associated with Cyg OB2. Among 
these stars, a total of 444 have an X-ray 
counterpart: 368 class II sources, 10 candi¬ 
dates with transition disks and 19 with pre¬ 
transition disks; 20 candidate accretors with 
intense Ha emission; 6 blue stars with disks 




(IGuarcello et all 120101) : and 16 candidate 
class I sources. We also detected the X-ray 
emission from 52 known O stars, 57 known 
B stars, and 6 emiss ion line objects selected 
by I Vink et al.l (120081) . Other existing classi¬ 
fications of low-mass members of Cyg OB2 
made use of X-ray observations which are 
part of the Chandra Cygnus QB2 Legacy 


Proje ct (lAlbacete Colombo et al.L120071: IWright & Drakd. 
20091) . so that they are included in the list 
of candidate members produced by our sur¬ 
vey. Fig. shows the r vs. r - i diagram of 
the optical counterpart of the X-ray sources 
with “reliable” OIR matches. Their locus in 
this diagram is well delimited by the chosen 
isochrones, which correspond to the locus 
of the candidate members with circumstel- 


lar disks selected by IGuarcello et al.l (120131) . 
This suggests that this sample is dominated 
by young stars associated with Cyg OB2. 
Optical-X-ray counterpart and disk-bearing 
sources show a slightly different distribu¬ 
tion in r, being the former more numerous 
at bright magnitudes. This can be understood 
as a consequence of: 1) The fact that low- 
mass stars holds their inner disks for longer 
time than high-mass stars; 2) the X -ray cat¬ 
alog is not complet e below 1 M© dWrighti 
2014. in preparation!) : 3) the bright end of 
the r distribution is more populated by can¬ 
didate foreground stars detected in X-rays. 
For instance, a significant population of field 
stars apparently older than Cyg OB2 stars 
lies in the bright blue part of the diagram (i.e. 
r < 15.5™ and r - i < 0.4™). These stars have 
an extinction significantly smaller than that 



Fig. 6.— r vs. r - i diagrams with all the X- 
ray sources with a “reliable” OIR counterpart 
with good optical photometry marked with 
gray dots, and the Cyg OB2 members with 
circumstellar disk marked with black dots. 
The isochrones, reddening vector, and masses 
are plotted as in Fig. |4] 


of the Cyg OB 2 stars, as inferred from other 
color-color diagrams. 

Figg. |7] and [8] show several optical and 
infrared color-color diagrams of the OIR 
sources with X-ray counterparts (black dots) 
classified as “reliable matches”, together with 
all the sources meeting the criteria for good 
photometry in the relevant bands (i.e. errors 
in colors smaller than 0.15™). In the r - i 
vs. i - z diagram most of the X-ray sources 
with optical counterparts lie in the area 
delimited by the 3.5 Myr isochrones from 
Siess et al. ( 2000 ) with extinction Ay = 2.6™ 
and Ay = 5.6™, which are the 10% and 90% 
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quantiles, resp ectively, of the o ptical extinc¬ 
tion found in Guarcello et al. ( 2012 1. The 
optical-l-X-ray sources to the left of the less 
extinguished isochrone are likely foreground 
sources, while possible background sources 
cannot be distinguished from the faintest 
stars associated with Cygnus OB2. These 
properties are only weakly dependent on the 
isochrones used. 

Another diagram where the foreground 
population can be easily distinguished from 
Cyg OB2 members is the r' - Ha vs. r' - i' 
diagram. The soli d lines are the ZAMS from 
Drew et al.l(l2005b with£»_'/ = 0™, 1™, 2™, 3™. 
The X-ray sources which lie close to the 
Eb-v = 0"' ZAMS are mainly in the fore¬ 
ground. Other classes of sources that can be 
distinguished are the background giants that 
mainly lie in the l ower part of the dia gram, 
below the ZAMS ( Wright et ah . 2008 ). and 
accreting stars which show very red r-Hcr col¬ 
ors. Only a handful of IPHAS-l-X-ray sources 
lie in this part of the diagram. Moreover, 
candidate A stars are expected to populate 
the locus in this diagram with in the dashed 
curved lines ([Drew et al.l. 120081) . 

The loci shown in the three infrared di¬ 
agrams in Fig. [F] (i.e. the Giants, Disk, 
and Galaxies loci) have been defined in 
Guarcello et al. ( 2013 ). In the J-H vs. H-K 
diagram only a few NIR-l-X-ray sources lie in 
the disk locus or at very high extinction. The 
first result is not surprising, since only 7.5% 
of the selected stars with disks in Cyg OB 2 
populate this locus (iGuarcello et al.L l2013h : 
the second result suggests that the back¬ 
ground contamination of the X-ray sources 
with NIR counterparts is low. Very small con¬ 
tamination from galaxies is also suggested 
by the [3.6] - [5.8] vs. [4.5] - [8.0] di¬ 
agram, which is one of the diagrams used 


in IGuarcello et al.l (1201 3l) for selecting disks 
and galaxies. In this diagram, the popula¬ 
tion of X-ray sources with IR excesses due 
to the presence of a circumstellar disk lie 
inside and nearby the disk locus in the up¬ 
per right part. Most of the X-ray sources 
with MIPS counterparts have intrinsic red 
colors, likely due to circumstellar disks, as 
shown in the [4.5] - [5.8] vs. [5.8] - [24] 
diagram. In all the discussed diagrams, the 
reddening vectors are taken from the extinc ¬ 
tion laws found bylRieke & Lebofskvid 19851) : 
O’Donnell ( 1994 ): Flaherty et al. ( 2007 ). 

In Fig. |9]we show the spatial distribution 
of all the X-ray sources with OIR counter¬ 
parts, which are clearly clustered in the center 
of the field that roughly cor responds whh the 


central cluster of Cyg QB2 (iBica et al.L 12003 


Guarcello et al.Ll2013[) . but there is also a rich 
sparse population across the entire field. The 
contours mark the emission at 8.0yum, show¬ 
ing the locations of the most dense nebular 
structures. 

The analysis of the X-ray properties of 
the X-ray sources with OIR counterpart is 
beyond the scope of this paper, and it will 
be part of dedicated studies dFlaccomioL 
in preparation! : lKashyapll20 14. in preparation!) 
A brief comparison of the observed X-ray 
photons energy between the X-ray sources 
with and without OIR counterpart is shown 
in the next section. 

5.3. Properties of the unmatched X-ray 
sources 

The different nature of the X-ray sources 
with and without OIR counterparts is evi¬ 
dent by comparing their spatial distribution 
and their median photon energy, shown in 
Figg. [To! and [TT] The distribution of the X- 
ray sources with OIR counterpart is peaked 


17 



























































Fig. 7.— Optical color-color diagrams with all the sources with good photometry in the involved 
eolors (gray dots) and the “reliable” counterparts of X-ray sources (black dots). In the r - i vs. i - z 
diagram the solid lines are 3.5 Myrs isoehrones with Ay = 2.6'” and Ay = 5.6'”; in the r' - Ha vs. 
r' - i' the solid lines are ZAMS with inereasing Eg^y from 0 to 4. 


at about 1.65 + 0.15keV. The distribution of 
the X-ray sources with no OIR eounterpart is 
completely different, being flatter, shifted to¬ 
ward higher energies and approximately bi- 
modal, with a high energy eut-off at about 
3.45 +0.15 keV. The two distributions ean be 
understood if the former is dominated by stars 
associated with Cyg OB2, while the latter is 
dominated by baekground and extragalactic 
sourees, mainly active galaetic nuclei, with 
also a small presenee of possible less extin¬ 
guished stars. The right panels show the dis¬ 
tributions of the net eounts for the two sam¬ 
ples of X-ray sourees. An evident exeess of 
faint X-ray sources is observed among the 
X-ray sources with no OIR eounterpart with 
compared to those with counterpart. 

Comparing the spatial distribution of the 


unmatehed X-ray sources (Fig. [TOl) with that 
of the OIR+X-ray stars (bottom right panel in 
Fig. |9l), it is evident that the latter show a high 
degree of elustering in the center of the field, 
as expeeted, while the former are almost uni¬ 
formly distributed. For instance, in the cen¬ 
tral area, within 8' from the median position 
of all the X-ray sources, fall the 24% of the 
X-ray+OIR sources, and only the 13% of the 
X-ray sources with no OIR eounterpart. In 
Fig. [To] we do not observe a strong eoneen- 
tration of sourees toward the most dense neb¬ 
ular structures, suggesting that the number 
of very extinguished members of Cyg OB 2 
among the unmatched X-ray sources is low. 
We expeet that a significant number of spuri¬ 
ous X-ray detections are in the area around 
Cygnus X-3, approximately at the position 
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Fig. 8.— NIR color-color diagrams with all the sources with good photometry in the involved 
eolors (gray dots) and the “reliable” eounterparts of X-ray sourees (blaek dots). The loei shown 
in the diagrams distinguish the typieal eolors expeeted from giants and normal stars at various 
extinetion, disk-bearing stars, and baekground galaxies. In the right panel, giants ean be found 
aeross all the diagram, exeept the loeus marked with “No giants”. 


marked as “X-3” in Fig. [TOl unmatehed X-ray souree versus their angu- 

Fig. [m shows the diagram of the J mag- lar separation. As expeeted, the distribution 

nitudes of the elosest OIR sourees to eaeh of sourees in this diagram is almost eomple- 
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X—ray sources with OIR counterpart 



Mean observed energy (keV) 

X-ray sources without OIR counterpart 


X—ray sources with OIR counterpart 



X-ray sources without OIR counterpart 



Mean observed energy (keV) 



Fig. 11.— Distributions of the median photon energy (left panels) and net eounts (right panels) of 
the X-ray sourees with (upper panels) and without (lower panels) OIR eounterparts. The vertieal 
dotted lines in the left panels mark the median values of the distributions. 


mentary to those in Fig. [51 The vast major¬ 
ity of the sourees here have large separations, 
with the exeeption of 206 X-ray sourees with 
an OIR souree eloser than 1 aresee. Of these, 
143 are faint in J (having J > 18'”), and only 
18 have a brighter J. For 45 of these sourees 
the J photometry is absent or of poor quality. 

6. Conclusions 

In this paper we eompile a eatalog of X- 
ray sourees with optieal and infrared eoun¬ 
terparts of the massive star forming region 
Cygnus OB2. The X-ray eatalog is ob¬ 
tained from the 1.08 Msee Chandra Cygnus 
OB2 Legaey Survey, eovering an area of 1 
square degree eentered on Cyg OB2. The 
X-ray eatalog eontains 7924 sourees. The 
optieal-infrared eatalogs have been eompiled 


by merging several deep eatalogs available 
for this region: From observations taken 
with OSIRIS @GTC (in riz bands), the pub- 
lie eatalogs SDSS/DR8 (in ugriz bands), 
IPHAS/DR2 (in r'i'Ha bands), UKIDSS/GPS 
and 2MASS/PSC (JHK bands), and from the 
Spitzer Legaey Survey of the Cygnus X re¬ 
gion (in IRAC and 24.0/rm bands). This eat- 
alog eontains 329514 sourees in the square 
degree area observed with Chandra/ACIS-I. 


We diseuss how a simple nearest-neighbor 
mateh would result in a highly eontaminated 
eatalog, with an exeess of false positives 
and spurious matehes. To avoid this, we 
adopted three different matehing proeedures, 
two based on a Maximum Likelihood ap- 
proaeh plus t he Ba yesian method defined in 


Navlor et al.l (l2013h . The differenee between 
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Fig. 9.— Spatial distribution of the X-ray 
sources with reliable OIR counterparts. The 
red solid lines mark the emission contours at 
8.0/zm. 

the two Maximum Likelihood methods con¬ 
sist in how the expected magnitude distribu¬ 
tion of the optical-infrared sources which are 
real counterparts of the X-ray sources. In one 
method this is obtained from a statistical ap¬ 
proach from the observed magnitude distri¬ 
bution of the optical-infrared source nearby 
the X-ray positions. In the latter, with using 
an accurate closest-neighbor match. All these 
three methods have been used and the results 
are merged in a single unique catalog. This 
merged catalog contains 5703 sources with 
X-ray and optical/infrared counterparts. We 
show that the most reliable optical-infrared 
counterparts are found with the Bayesian 
method and the Maximum Likelihood which 
uses the closest-neighbor match to define 
the expected correlated magnitude distribu- 


Fig. 10.— Spatial distribution of the X-ray 
sources without OIR counterparts, with over¬ 
plotted the emission contours at 8.0 jjm. 

tion, counting 5619 matches. The nature of 
these sources is discussed using optical and 
infrared color-color and color-magnitude dia¬ 
grams: The vast majority are compatible with 
being associated with Cygnus OB2; a low 
contamination from candidate background 
galaxies and giant stars is observed, while 
the foreground population looks to be more 
significantly represented. Our combined cat¬ 
alog also contains 444 stars with disks, 52 O 
stars, and 57 B stars. 

This paper is based on an extensive dataset: 
optical data provided by observations made 
with the Gran Telescopio Canarias (GTC), in¬ 
stalled in the Spanish Observatorio del Roque 
de los Muchachos of the Institute de As- 
troffsica de Canarias, in the island of La 
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Fig. 12.— Distribution of the J magnitude 
of the elosest OIR sources to the unmatched 
X-ray sources vs. their angular separation. 

Palma; data from the IPHAS survey, based 
on observations carried out at the Isaac New¬ 
ton Telescope, INT, operated on the island 
of La Palma by the Isaac Newton Group in 
the Spanish Observatorio del Roque de los 
Muchachos of the Institute de Astrofisica de 
Canarias; data from the SDSS Data Release 
9, funded by the Alfred P. Sloan Foundation, 
the Participating Institutions, the National 
Science Foundation, and the U.S. Department 
of Energy Office of Science; data products 
from the Two Micron All Sky Survey, which 
is a joint project of the University of Mas¬ 
sachusetts and the Infrared Processing and 
Analysis Center/California Institute of Tech¬ 
nology, funded by the National Aeronautics 
and Space Administration and the National 
Science Foundation; data based on observa¬ 
tions made with the Spitzer Space Telescope, 
which is operated by the Jet Propulsion Lab¬ 


oratory, California Institute of Technology, 
under contract with NASA; and data obtained 
as part of the UKIRT Infrared Deep Sky Sur¬ 
vey (UKIDSS), which used the UKIRT Wide 
Field Camera (WFCAM). The authors also 
made an extensive use of the software TOP- 
CAT to manipulate the used catalogs. 
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